539 research outputs found

    Robust Linear Models for Cis-eQTL Analysis

    Get PDF
    Expression Quantitative Trait Loci (eQTL) analysis enables characterisation of functional genetic variation influencing expression levels of individual genes. In outbread populations, including humans, eQTLs are commonly analysed using the conventional linear model, adjusting for relevant covariates, assuming an allelic dosage model and a Gaussian error term. However, gene expression data generally have noise that induces heavy-tailed errors relative to the Gaussian distribution and often include atypical observations, or outliers. Such departures from modelling assumptions can lead to an increased rate of type II errors (false negatives), and to some extent also type I errors (false positives). Careful model checking can reduce the risk of type-I errors but often not type II errors, since it is generally too time-consuming to carefully check all models with a non-significant effect in large-scale and genome-wide studies. Here we propose the application of a robust linear model for eQTL analysis to reduce adverse effects of deviations from the assumption of Gaussian residuals. We present results from a simulation study as well as results from the analysis of real eQTL data sets. Our findings suggest that in many situations robust models have the potential to provide more reliable eQTL results compared to conventional linear models, particularly in respect to reducing type II errors due to non-Gaussian noise. Post-genomic data, such as that generated in genome-wide eQTL studies, are often noisy and frequently contain atypical observations. Robust statistical models have the potential to provide more reliable results and increased statistical power under non-Gaussian conditions. The results presented here suggest that robust models should be considered routinely alongside other commonly used methodologies for eQTL analysis.NonePublishe

    Identification of novel putative rheumatoid arthritis susceptibility genes via analysis of rare variants

    Get PDF
    Established loci for rheumatoid arthritis (RA), including HLA-DRB1 and PTPN22, do not fully account for the genetic component of susceptibility to the disease. One possible source of as yet undiscovered susceptibility genes are those mediated through effects of rare variants. We present a novel method for gene-based genome-wide scans of whole-genome association (WGA) data to identify accumulations of rare variants associated with disease. We apply our method to WGA SNP genotype data obtained from 868 RA cases and 1194 controls. Our results highlight novel putative RA susceptibility genes that have not previously been identified in large-scale WGA studies

    Towards Deep Cellular Phenotyping in Placental Histology

    Full text link
    The placenta is a complex organ, playing multiple roles during fetal development. Very little is known about the association between placental morphological abnormalities and fetal physiology. In this work, we present an open sourced, computationally tractable deep learning pipeline to analyse placenta histology at the level of the cell. By utilising two deep Convolutional Neural Network architectures and transfer learning, we can robustly localise and classify placental cells within five classes with an accuracy of 89%. Furthermore, we learn deep embeddings encoding phenotypic knowledge that is capable of both stratifying five distinct cell populations and learn intraclass phenotypic variance. We envisage that the automation of this pipeline to population scale studies of placenta histology has the potential to improve our understanding of basic cellular placental biology and its variations, particularly its role in predicting adverse birth outcomes.Comment: Updated MRC funding material. Corrected typo that suggested ensembling and Inception accuracy were the same (updated to reflect the fact the ensemble model is 1% better than previously reported

    Genome-wide association study of adipocyte lipolysis in the GENetics of adipocyte lipolysis (GENiAL) cohort.

    Get PDF
    OBJECTIVES:Lipolysis, hydrolysis of triglycerides to fatty acids in adipocytes, is tightly regulated, poorly understood, and, if perturbed, can lead to metabolic diseases including obesity and type 2 diabetes. The goal of this study was to identify the genetic regulators of lipolysis and elucidate their molecular mechanisms. METHODS:Adipocytes from abdominal subcutaneous adipose tissue biopsies were isolated and were incubated without (spontaneous lipolysis) or with a catecholamine (stimulated lipolysis) to analyze lipolysis. DNA was extracted and genome-wide genotyping and imputation conducted. After quality control, 939 samples with genetic and lipolysis data were available. Genome-wide association studies of spontaneous and stimulated lipolysis were conducted. Subsequent in vitro gene expression analyses were used to identify candidate genes and explore their regulation of adipose tissue biology. RESULTS:One locus on chromosome 19 demonstrated genome-wide significance with spontaneous lipolysis. 60 loci showed suggestive associations with spontaneous or stimulated lipolysis, of which many influenced both traits. In the chromosome 19 locus, only HIF3A was expressed in the adipocytes and displayed genotype-dependent gene expression. HIF3A knockdown in vitro increased lipolysis and the expression of key lipolysis-regulating genes. CONCLUSIONS:In conclusion, we identified a genetic regulator of spontaneous lipolysis and provided evidence of HIF3A as a novel key regulator of lipolysis in subcutaneous adipocytes as the mechanism through which the locus influences adipose tissue biology

    Variability of gene expression profiles in human blood and lymphoblastoid cell lines

    Get PDF
    BACKGROUND: Readily accessible samples such as peripheral blood or cell lines are increasingly being used in large cohorts to characterise gene expression differences between a patient group and healthy controls. However, cell and RNA isolation procedures and the variety of cell types that make up whole blood can affect gene expression measurements. We therefore systematically investigated global gene expression profiles in peripheral blood from six individuals collected during two visits by comparing five of the following cell and RNA isolation methods: whole blood (PAXgene), peripheral blood mononuclear cells (PBMCs), lymphoblastoid cell lines (LCLs), CD19 and CD20 specific B-cell subsets. RESULTS: Gene expression measurements were clearly discriminated by isolation method although the reproducibility was high for all methods (range rho = 0.90-1.00). The PAXgene samples showed a decrease in the number of expressed genes (P < 1*10(-16)) with higher variability (P < 1*10(-16)) compared to the other methods. Differentially expressed probes between PAXgene and PBMCs were correlated with the number of monocytes, lymphocytes, neutrophils or erythrocytes. The correlations (rho = 0.83; rho = 0.79) of the expression levels of detected probes between LCLs and B-cell subsets were much lower compared to the two B-cell isolation methods (rho = 0.98). Gene ontology analysis of detected genes showed that genes involved in inflammatory responses are enriched in B-cells CD19 and CD20 whereas genes involved in alcohol metabolic process and the cell cycle were enriched in LCLs. CONCLUSION: Gene expression profiles in blood-based samples are strongly dependent on the predominant constituent cell type(s) and RNA isolation method. It is crucial to understand the differences and variability of gene expression measurements between cell and RNA isolation procedures, and their relevance to disease processes, before application in large clinical studies

    Interactions between Glutathione S-Transferase P1, Tumor Necrosis Factor, and Traffic-Related Air Pollution for Development of Childhood Allergic Disease

    Get PDF
    BACKGROUND: Air pollutants may induce airway inflammation and sensitization due to generation of reactive oxygen species. The genetic background to these mechanisms could be important effect modifiers. OBJECTIVE: Our goal was to assess interactions between exposure to air pollution and single nucleotide polymorphisms (SNPs) in the beta2-adrenergic receptor (ADRB2), glutathione S-transferase P1 (GSTP1), and tumor necrosis factor (TNF) genes for development of childhood allergic disease. METHODS: In a birth cohort originally of 4,089 children, we assessed air pollution from local traffic using nitrogen oxides (traffic NO(x)) as an indicator based on emission databases and dispersion modeling and estimated individual exposure through geocoding of home addresses. We measured peak expiratory flow rates and specific IgE for inhalant and food allergens at 4 years of age, and selected children with asthma symptoms up to 4 years of age (n = 542) and controls (n = 542) for genotyping. RESULTS: Interaction effects on allergic sensitization were indicated between several GSTP1 SNPs and traffic NO(x) exposure during the first year of life (p(nominal) < 0.001-0.06). Children with Ile105Val/Val105Val genotypes were at increased risk of sensitization to any allergen when exposed to elevated levels of traffic NO(x) (for a difference between the 5th and 95th percentile of exposure: odds ratio = 2.4; 95% confidence interval, 1.0-5.3). In children with TNF-308 GA/AA genotypes, the GSTP1-NO(x) interaction effect was even more pronounced. We observed no conclusive interaction effects for ADRB2. CONCLUSION: The effect of air pollution from traffic on childhood allergy appears to be modified by GSTP1 and TNF variants, supporting a role of genes controlling the antioxidative system and inflammatory response in allergy

    Sex differences in the risk of coronary heart disease associated with type 2 diabetes:a Mendelian Randomization analysis

    Get PDF
    OBJECTIVE Observational studies have demonstrated that type 2 diabetes is a stronger risk factor for coronary heart disease (CHD) in women compared with men. However, it is not clear whether this reflects a sex differential in the causal effect of diabetes on CHD risk or results from sex-specific residual confounding. RESEARCH DESIGN AND METHODS Using 270 single nucleotide polymorphisms (SNPs) for type 2 diabetes identified in a type 2 diabetes genome-wide association study, we performed a sex-stratified Mendelian randomization (MR) study of type 2 diabetes and CHD using individual participant data in UK Biobank (251,420 women and 212,049 men). Weighted median, MR-Egger, MR-pleiotropy residual sum and outlier, and radial MR from summary-level analyses were used for pleiotropy assessment. RESULTS MR analyses showed that genetic risk of type 2 diabetes increased the odds of CHD for women (odds ratio 1.13 [95% CI 1.08–1.18] per 1-log unit increase in odds of type 2 diabetes) and men (1.21 [1.17–1.26] per 1-log unit increase in odds of type 2 diabetes). Sensitivity analyses showed some evidence of directional pleiotropy; however, results were similar after correction for outlier SNPs. CONCLUSIONS This MR analysis supports a causal effect of genetic liability to type 2 diabetes on risk of CHD that is not stronger for women than men. Assuming a lack of bias, these findings suggest that the prevention and management of type 2 diabetes for CHD risk reduction is of equal priority in both sexes

    Large-scale meta-analysis highlights the hypothalamic–pituitary–gonadal axis in the genetic regulation of menstrual cycle length

    Get PDF
    The normal menstrual cycle requires a delicate interplay between the hypothalamus, pituitary and ovary. Therefore, its length is an important indicator of female reproductive health. Menstrual cycle length has been shown to be partially controlled by genetic factors, especially in the follicle-stimulating hormone beta-subunit (FSHB) locus. A genome-wide association study meta-analysis of menstrual cycle length in 44 871 women of European ancestry confirmed the previously observed association with the FSHB locus and identified four additional novel signals in, or near, the GNRH1, PGR, NR5A2 and INS-IGF2 genes. These findings not only confirm the role of the hypothalamic–pituitary–gonadal axis in the genetic regulation of menstrual cycle length but also highlight potential novel local regulatory mechanisms, such as those mediated by IGF2

    MicroRNA Expression in Abdominal and Gluteal Adipose Tissue Is Associated with mRNA Expression Levels and Partly Genetically Driven

    Get PDF
    To understand how miRNAs contribute to the molecular phenotype of adipose tissues and related traits, we performed global miRNA expression profiling in subcutaneous abdominal and gluteal adipose tissue of 70 human subjects and characterised which miRNAs were differentially expressed between these tissues. We found that 12% of the miRNAs were significantly differentially expressed between abdominal and gluteal adipose tissue (FDR adjusted p<0.05) in the primary study, of which 59 replicated in a follow-up study of 40 additional subjects. Further, 14 miRNAs were found to be associated with metabolic syndrome case-control status in abdominal tissue and three of these replicated (primary study: FDR adjusted p<0.05, replication: p<0.05 and directionally consistent effect). Genome-wide genotyping was performed in the 70 subjects to enable miRNA expression quantitative trait loci (eQTL) analysis. Candidate miRNA eQTLs were followed-up in the additional 40 subjects and six significant, independent cis-located miRNA eQTLs (primary study: p<0.001; replication: p<0.05 and directionally consistent effect) were identified. Finally, global mRNA expression profiling was performed in both tissues to enable association analysis between miRNA and target mRNA expression levels. We find 22% miRNAs in abdominal and 9% miRNAs in gluteal adipose tissue with expression levels significantly associated with the expression of corresponding target mRNAs (FDR adjusted p<0.05). Taken together, our results indicate a clear difference in the miRNA molecular phenotypic profile of abdominal and gluteal adipose tissue, that the expressions of some miRNAs are influenced by cis-located genetic variants and that miRNAs are associated with expression levels of their predicted mRNA targets
    corecore